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Abstract. We develop a clear connection between deFinetti's theorem 
for exchangeable arrays (work of Aldous-Hoover-Kallenberg) and the 
emerging area of graph limits (work of Lovasz and many coauthors). 
Along the way, we translate the graph theory into more classical prob- 
ability. 



1. Introduction 

DeFinetti's profound contributions are now woven into many parts of 
probability, statistics and philosophy. Here we show how developments from 
deFinetti's work on partial exchangeability have a direct link to the recent 
development of a limiting theory for large graphs. This introduction first 
recalls the theory of exchangeable arrays (Section ll.ip . Then, the subject of 
graph limits is outlined (Section \1.2\i . Finally, the link between these ideas, 
which forms the bulk of this paper, is outlined (Section ll.3|) . 

1.1. Exchangeability, partial exchangeability and exchangeable ar- 
rays. Let {Xj}, 1 < i < oo, be a sequence of binary random variables. 
They are exchangeable if 

P(X l = ei ,...,X n = e n )= F(Xt = e CT(1) , . ..,X n = e a{n) ) 

for all n, permutations a G 6 n and all ej € {0, 1}. The celebrated represen- 
tation theorem says 

Theorem 1.1 (deFinetti). // 1 < i < oo, is a binary exchangeable 

sequence, then: 

(i) With probability 1, X^ = lim - {X\ + • • • + X n ) exists. 

(ii) If n{A) = P{X 00 € A}, then for all n and e^, 1 < i < n, 

F(X 1 = e 1 ,...,X n = e n )= f x s (l - x) n - s fi(dx) (1.1) 

Jo 

for s = ei H h e n . 

It is natural to refine and extend deFinetti's theorem to allow more general 
observables (JQ with values in a Polish space) and other notions of symmetry 
(partial exchangeability). A definitive treatment of these developments is 
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given in Kallenberg [la ]. Of interest here is the extension of deFinetti's 
theorem to two-dimensional arrays. 

Definition. Let {Xij}, 1 < i,j < oo, be binary random variables. They 
are separately exchangeable if 



P(Xy = eij , 1 < i,j < n) = nXij = e a{l)T{j) , 1 < i,j < n) (1.2) 



for all n, all permutations u, r € & n and all e^ € {0, 1}. They are (jointly) 
exchangeable if (|1.2[) holds in the special case r = cr. 

Equivalently, the array {-Xy} is jointly exchangeable if the array {Xg.^yy ) } 
has the same distribution as {-Xy} for every permutation <r of N, and simi- 
larly for separate exchangeability. 

The question of two-dimensional versions of deFinetti's theorem under 
(separate) exchangeability arose from the statistical problems of two-way 
analysis of variance. Early workers expected a version of (ll.lj) with perhaps a 
two-dimensional integral. The probabilist David Aldous [l[ and the logician 
Douglas Hoover [3] found that the answer is more complicated. 




Define a random binary array {Xij} as follows: Let Ui, Vj, 1 < i,j < oo, 
be independent and uniform in [0, 1]. Let W(x, y) be a function from [0, l] 2 
to [0, 1]. Let be 1 or as a W(Ui, Vj)-coin comes up heads or tails. Let 
Pw be the probability distribution of {X^}, 1 < i, j < oo. The family {X^} 
is separately exchangeable because of the symmetry of the construction. The 
Aldous-Hoover theorem says that any separately exchangeable binary array 
is a mixture of such Pw. 

Theorem 1.2 (Aldous-Hoover). Let X = {X^}, 1 < i,j < oo, be a sepa- 
rately exchangeable binary array. Then, there is a probability \x such that 



There is a similar result for jointly exchangeable arrays. 

The uniqueness of \i resisted understanding; if W is obtained from W by 
a measure-preserving change of each variable, clearly the associated process 
{Xij} has the same joint distribution as {Xij}. Using model theory, Hoover 
[141 ] was able to show that this was the only source of non- uniqueness. A 
'probabilist's proof was finally found by Kallenberg, see 0, Sect. 7.6] for 
details and references. 

These results hold for higher dimensional arrays with X^ taking values in 
a Polish space with minor change [15, Chap. 7]. The description above has 
not mentioned several elegant results of the theory. In particular, Kallen- 
berg's 'spreadable' version of the theory replaces invariance under a group 
by invariance under subsequences. A variety of tail fields may be introduced 
to allow characterizing when W takes values in {0,1} [id . Sect. 4]. Much 
more general notions of partial exchangeability are studed in [ll|] . 
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1.2. Graph limits. Large graphs, both random and deterministic, abound 
in applications. They arise from the internet, social networks, gene regula- 
tion, ecology and in mathematics. It is natural to seek an approximation 
theory: What does it mean for a sequence of graphs to converge? When can 
a large complex graph be approximated by a small graph? 

In a sequence of papers 0, Q, S, I, EE 0, E3, [UlH M, E3, Ejj , Laszlo 
Lovasz with coauthors (listed here in order of frequency) V. T. Sos, B. 
Szegedy, C. Borgs, J. Chayes, K. Vesztergombi, A. Schrijver, M. Freedman 
have developed a beautiful, unifying limit theory. This sheds light on topics 
such as graph homomorphisms, Szemeredi's regularity lemma, quasi-random 
graphs, graph testing and extremal graph theory. Their theory has been 
developed for dense graphs (number of edges comparable with the square of 
number of vertices) but parallel theories for sparse graphs are beginning to 
emerge [3]. 

Roughly, a growing sequence of finite graphs G n converges if, for any fixed 
graph F, the proportion of copies of F in G n converges. Section [2] below 
has precise definitions. 

Example 1.3. Define a probability distribution on graphs on n-vertices as 
follows. Flip a #-coin for each vertex (dividing vertices into 'boys' and 'girls'). 
Connect two boys with probability p. Connect two girls with probability p'. 
Connect a boy and a girl with probability p" . Thus, if p = p' = 0, p" = 1, we 
have a random bipartite graph. If p = p' = 1, p" = 0, we have two disjoint 
complete graphs. If p = p' = p" , we have the Erdos-Renyi model. As n 
grows, these models generate a sequence of random graphs which converge 
almost surely to a limiting object described below. 

More substantial examples involving random threshold graphs are in [121 ] . 

If a sequence of graphs converges, what does it converge to? For exchange- 
able random graphs (defined below), there is a limiting object which may 
be thought of as a probability measure on infinite random graphs. Sup- 
pose W(x,y) = W(y,x) is a function from [0, l] 2 — > [0,1]. Choose {Ui}, 
1 < i < oo, independent uniformly distributed random variables on [0,1]. 
Form an infinite random graph by putting an edge from i to j with probabil- 
ity W(Ui,Uj). This measure on graphs (or alternatively W) is the limiting 
object. 

For the "boys and girls" example above, W may be pictured as 
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The theory developed shows that various properties of G ri can be well 
approximated by calculations with the limiting object. There is an elegant 
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characterization of these 'continuous graph properties' with applications to 
algorithms for graph testing (Does this graph contain an Eulerian cycle?) or 
parameter estimation (What is an approximation to the size of the maximum 
cut?). There is a practical way to find useful approximations to a large graph 
by graphs of fixed size 0]. This paper also contains a useful review of the 
current state of the theory with proofs and references. 

We have sketched the theory for unweighted graphs. There are general- 
izations to graphs with weights on vertices and edges, to bipartite, directed 
and hypergraphs. The sketch leaves out many nice developments. For exam- 
ple, the useful cut metric between graphs [19( and connections to statistical 
physics [§]. 

1.3. Overview of the present paper. There is an apparent similarity 
between the measure P\y of the Aldous-Hoover theorem and the limiting 
object W from graph limits. Roughly, working with symmetric W gives 
the graph limit theory; working with general W gives directed graphs. The 
main results of this paper make these connections precise. 

Basic definitions are in Section [2] which introduces a probabilist's version 
of graph convergence equivalent to the definition using graph homomor- 
phisms. Section [3] uses the well-established theory of weak convergence of 
a sequence of probability measures on a metric space to get properties of 
graph convergence. Section U] carries things over to infinite graphs. 

The main results appear in Section This introduces exchangeable 
random graphs and gives a one-to-one correspondence between infinite ex- 
changeable random graphs and distributions on the space of proper graph 
limits (Theorem 15 . 3[) . which specializes to a one-to-one correspondence be- 
tween proper graph limits and extreme points in the set of distributions of 
exchangeable random graphs (Corollary I5.4[) . 

A useful characterization of the extreme points of the set of exchange- 
able random graphs is in Theorem 15.51 These results are translated to the 
equivalence between proper graph limits and the Aldous-Hoover theory in 
Section [6j The non-uniqueness of the representing W, for exchangeable 
random graphs and for graph limits, is discussed in Section [7J 

The equivalence involves symmetric W(x,y) and a single permutation a 
taking W(Ui,Uj) to W(U a ^, U a ^). The original Aldous-Hoover theorem, 
with perhaps non-symmetric W(x, y) and W(Ui, Vj) to W(U a /{\, V T (j)) trans- 
lates to a limit theorem for bipartite graphs. This is developed in Sectional 
The third case of the Aldous-Hoover theory for two-dimensional arrays, 
perhaps non-symmetric W(x, y) and a single permutation u, corresponds to 
directed graphs; this is sketched in Section [H 

The extensions to weighted graphs are covered by allowing Xij to take 
general values in the Aldous-Hoover theory. The extension to hypergraphs 
follows from the Aldous-Hoover theory for higher-dimensional arrays. (The 
details of these extensions are left to the reader.) 
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Despite these parallels, the theories have much to contribute to each other. 
The algorithmic, graph testing, Szemeredi partitioning perspective is new 
to exchangeability theory. Indeed, the "boys and girls" random graph was 
introduced to study the psychology of vision in Diaconis-Freedman (1981). 
As far as we know, its graph theoretic properties have not been studied. The 
various developments around shell-fields in exchangeability, which character- 
ize zero/one W(x,y), have yet to be translated into graph-theoretic terms. 

Acknowledgements. This lecture is an extended version of a talk pre- 
sented by PD at the 100th anniversary of deFinetti's birth in Rome, 2006. 
We thank the organizers. This work was partially funded by the French 
ANR's Chaire d'excellence grant to PD. 

SJ thanks Christian Borgs and Jennifer Chayes for inspiration from lec- 
tures and discussions during the Oberwolfach meeting 'Combinatorics, Prob- 
ability and Computing', held in November, 2006. Parts of the research were 
completed during a visit by SJ to the Universite de Nice - Sophia Antipolis 
in January 2007. 

2. Definitions and basic properties 

All graphs will be simple. Infinite graphs will be important in later sec- 
tions, but will always be clearly stated to be infinite; otherwise, graphs 
will be finite. We denote the vertex and edge sets of a graph G by V(G) 
and E(G), and the numbers of vertices and edges by v(G) := |V(G)| and 
e(G) := \E(G)\. We consider both labelled and unlabelled graphs; the labels 
will be the integers 1, . . . , n, where n is the number of vertices in the graph. 
A labelled graph is thus a graph with vertex set [n] := {1, . . . , n} for some 

n > 1; we let C n denote the set of the 2(2) labelled graphs on [n] and let 
C := U^Li An unlabelled graph can be regarded as a labelled graph 
where we ignore the labels; formally, we define U n , the set of unlabelled 
graphs of order n, as the quotient set C n / = of labelled graphs modulo iso- 
morphisms. We let IA := UnLi^n = ^1 — > the set of all unlabelled graphs. 

Note that we can, and often will, regard a labelled graph as an unlabelled 
graph. 

If G is an (unlabelled) graph and v± , . . . , is a sequence of vertices in G, 
then G(vi, . . . ,Vk) denotes the labelled graph with vertex set [k] where we 
put an edge between i and j if Vi and vj are adjacent in G. We allow the 
possibility that Vi = Vj for some % and j. (In this case, there is no edge ij 
because there are no loops in G.) 

We let G[k], for k > 1, be the random graph G(vi, . . . ,Vk) obtained by 
sampling v\,...,Vk uniformly at random among the vertices of G, with re- 
placement. In other words, v 1, . . . , Vk are independent uniformly distributed 
random vertices of G. 

For k < v(G), we further let G[k]' be the random graph G(v[, . . . ,v' k ) 
where we sample v[,. . . ,v' k uniformly at random without replacement; the 
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sequence v[, . . . ,v' k is thus a uniformly distributed random sequence of k 
distinct vertices. 

The graph limit theory in [3] and subsequent papers is based on the study 
of the functional t(F, G) which is defined for two graphs F and G as the 
proportion of all mappings V{F) — ► V(G) that are graph homomorphisms 
F — > G, i.e., map adjacent vertices to adjacent vertices. In probabilistic 
terms, t(F,G) is the probability that a uniform random mapping V{F) — > 
V(G) is a graph homomorphism. Using the notation introduced above, we 
can, equivalently, write this as, assuming that F is labelled and k = v(F), 

t(F, G) := W(F C G[k]) . (2.1) 

Note that both F and G[k] are graphs on [k], so the relation F C G[fc] is 
well-defined as containment of labelled graphs on the same vertex set, i.e. 
as E(F) C E(G[k]). Although the relation F C may depend on the 
labelling of F, the probability in (|2.1h does not, by symmetry, so t(F, G) is 
really well defined by (12. ID for unlabelled F and G. 

With F, G and k as in (|2.ip . we further define, again following [l^] (and 
the notation of 0]) but stating the definitions in different but equivalent 
forms, 

t in j(F,G):=F(FCG[k]') (2.2) 

and 

t iad (F,G) :=W(F = G[k}'), (2.3) 

provided F and G are (unlabelled) graphs with v(F) < v{G). If v (F) > v(G) 
we set t inj (F, G) := i ind (F, G) := 0. 

Since the probability that a random sample v\ , . . . , Vf. of vertices in G 



contains some repeated vertex is < k /(2v(G)), it follows that 19|] 



\t(F,G)-t- mi (F,G)\ <|S- (2-4) 

Hence, when considering asymptotics with u(Gr) — > oo, it does not matter 
whether we use t or i in j. Moreover, if F G then, as pointed out in 
and I3, 

t inj (F,G)= J2 tindC^.G) (2.5) 

F'g£ fc , F'DF 

and, by inclusion-exclusion, 

iind(F,G)= ^ (-l) e ( F ')- £ ( F )t inj (F,G). (2.6) 

F'e£ fc , F'DF 

Hence, the two families {tj n j(.F, -)}FeU an d {iind(F, -)}_FeW of graph func- 
tionals contain the same information and can replace each other. 

The basic definition of Lovasz and Szegedy [19[] and Borgs, Chayes, Lovasz, 
Sos and Vesztergombi [§] is that a sequence (G n ) of graphs converges if 
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t(F, G n ) converges for every graph F. We can express this by considering 
the map r : U -» [0, if defined by 

T(G):=(t(F,G)) F& i€[0,l] u . (2.7) 

Then (G n ) converges if and only if r(G n ) converges in [0, if , equipped with 
the usual product topology. Note that [0, if is a compact metric space; as 
is well known, a metric can be defined by, for example, 

oo 

d{(x F ),(y F )) :=^2" i |x Fi -y Fi |, (2.8) 
i=0 

where F\, F2, . . . is some enumeration of all unlabelled graphs. 

We define W := t(U) C [0, if to be the image of U under this mapping 
r, and let IA* be the closure of IA* in [0, if . Thus IA* is a compact metric 
space. (For explicit descriptions of the subset IA* of [0, if as a set of graph 
functional, see Lovasz and Szegedy (l9j|.) 

As pointed out in [lj| and [g] (in equivalent terminology), r is not in- 
jective; for example, T(K n „) is the same for all complete bipartite graphs 
K n ^ n . Nevertheless, as in and we can consider a graph G as an 
element of IA* by identifying G and t(G) (thus identifying graphs with the 
same t(G)), and then convergence of (G n ) as defined above is equivalent to 
convergence in IA* . The limit is thus an element of I A* , but typically not a 
graph in IA* . The main result of Lovasz and Szegedy [19| is a representation 
of the elements in IA* to which we will return in Section [6J 

Remark 2.1. As said above, IA* is a compact metric space, and it can 
be given several equivalent metrics. One metric is the metric (|2.8|) inher- 
ited from [0, if, which for graphs becomes d(G,G') = J2i ^~ l \t(Fi, G) — 
t(Fi, G')\. Another metric, shown by Borgs, Chayes, Lovasz, Sos and Veszter- 
gombi to be equivalent, is the cut-distance 5\j, see @] for definitions. Fur- 
ther characterizations of convergence of sequences of graphs in IA are given 
in Borgs, Chayes, Lovasz, Sos and Vesztergombi |8j, |9(]. 

The identification of graphs with the same image in IA* (i.e., with the 
same t(F, •) for all F) is sometimes elegant but at other times inconvenient. 
It can be avoided if we instead let IA + be the union of IA and some one-point 
set {*} and consider the mapping r + :U->[0, lf + = [0, if x [0, 1] defined 
by 

t+(G) = (t(G),v(G)- 1 ). (2.9) 

Then r + is injective, because if r{G{) = t(C?2) for two graphs G\ and G2 
with the same number of vertices, then G\ and G2 are isomorphic and thus 
G\ = G2 as unlabelled graphs. (This can easily be shown directly: it follows 

from (12.11) that G%[k] = Gi\k\ for every k, which implies G\[k]' = G2[k] f for 
every k < v{G\) = v (G2); now take k = v(Gi). It is also a consequence of 
[1, Theorem 2.7 and Theorem 2.3 or Lemma 5.1].) 
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Consequently, we can identify IA with its image r + (IA) C [0,l] w and 
define IA C [0, as its closure. It is easily seen that a sequence (G n ) of 
graphs converges in IA if and only if either v(G n ) — > oo and (G n ) converges in 
IA* , or the sequence (G n ) is constant from some no on. Hence, convergence 
in IA is essentially the same as the convergence considered by by Lovasz 
and Szegedy [19], but without any identification of non-isomorphic graphs 
of different orders. 

Alternatively, we can consider T ia ^ or r inc j defined by 

Tinj(G) :=(t inj (F,G)) FeU G [0,1]", 
Tind(G) :=(t ind (F,G)) FeU G [0, 1} U . 

It is easy to see that both and T m d are injective mappings IA — > [0, l] w . 
(If r inj (F,Gi) = T in j(F,G 2 ) for all F, we take F = G\ and F = G 2 and 
conclude Gi = G*2, using our special definition above when v(F) > v(G).) 
Hence, we can again identify IA with its image and consider its closure IA 
in [0,l] w . Moreover, using (|2.4|) . (|2.5|) . and (|2.6|) . it is easily shown that if 
(Cr n ) is a sequence of unlabelled graphs, then 

T + {G n ) converges -s=>- T m d(G n ) converges T; n j(G n ) converges. 

Hence, the three compactifications t + (IA), T m j(IA), Ti n d(lA) are homeomorphic 
and we can use any of them for IA. We let Uoo :=U\U; this is the set of all 
limit objects of sequences (G n ) in IA with v(G n ) — > cxd. (I.e., it is the set of 
all proper graph limits.) 

We will in the sequel prefer to use IA rather than IA* , thus not identifying 
some graphs of different orders, nor identifying finite graphs with some limit 
objects in Uoo. 

For every fixed graph F, the functions t(F, •), ti n i(F, •) and ti n d(F, •) have 
unique continuous extensions to IA, for which we use the same notation. We 
similarly extend f(-) continuously to IA by defining v(G) = 00 and thus 
v{G)^ = for G € Hoc := U \ U. Then jjlOjl . <|23|> and ([22]) hold for all 
G £lA, where ([2.4p means that 

t- mi {F,G)=t(F,G), GelA^. (2.10) 

Note that is a compact metric space. Different, equivalent, metrics 
are given by the embed. dings t~^, Tinj? 'Hnd 

into [0, l] w+ and [0, l] w . Another 
equivalent metric is, by Remark 12.11 and the definition of r + , 5\j(Gi,G2) + 
\v(G 1 )- 1 -v(G 2 )- 1 \. 

We summarize the results above on convergence. 

Theorem 2.1. A sequence (G n ) of graphs converges in the sense of Lovasz 
and Szegedy fldi] if and only if it converges in the compact metric space IA* . 
Moreover, if v{G n ) — > 00, the sequence (G n ) converges in this sense if and 
only if it converges in IA. 
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The projection vr : [0, 1} U+ = [0, if x [0, 1] -> [0, l]^_maps t+(G) to t{G) 
for every graph G, so by continuity it maps U into U*. For graph G £ W, 
7r(G) = r(G) is the object in U* corresponding to G considered above, and 
we will in the sequel denote this object by vr(G); recall that this projection 
U — > U* is not injective. (We thus distinguish between a graph G and its 
"ghost" 7t(GQ in Recall that when graphs are considered as elements of 
U* as in [l^] and 0], certain graphs are identified with each other; we avoid 
this.) On the other hand, an element G of IA is by definition determined by 
r(G) and v(G)^ 1 , cf. (|2.9p . so the restriction 7r : U n — > W* is injective for 
each n < oo. In particular, 7r : Uoo — > is injective. Moreover, this map is 
surjective because every element G G IA* is the limit of some sequence (G n ) 
of graphs in IA with f(G n ) — » oo; by Theorem 12. 1\ this sequence converges 
in U to some element G", and then n(G') = G. Since Uoo is compact, the 
restriction of ir to Woo is thus a homeomorphism, and we have the following 
theorem, saying that we can identify the set Uoo of proper graph limits with 
W. 

Theorem 2.2. The projection ir maps the set Uoo :=IA\IA of proper graph 
limits homeomorphically onto IA* . 

3. Convergence of random graphs 

A random unlabelled graph is a random element of IA (with any distribu- 
tion; we do not imply any particular model). We consider convergence of 
a sequence (G n ) of random unlabelled graphs in the larger space U; recall 
that this is a compact metric space so we may use the general theory set 
forth in, for example, Billingsley 0]. 

We use the standard notations — — — for convergence in distribu- 
tion, probability, and alsmost surely, respectively. We will only consider the 
case when v(G n ) — > oo, at least in probability. (The reader may think of 
the case when G n has n vertices, although that is not necessary in general.) 

We begin with convergence in distribution. 

Theorem 3.1. Let G n , n > 1, be random unlabelled graphs and assume 
that v(G n ) — > oo. The following are equivalent, as n — > oo. 

(i) G n — — > r for some random T £lA. 

(ii) For every finite family F±, . . . , F m of (non-random) graphs, the ran- 
dom variables t(F\, G n ), . . . , t(F m , G n ) converge jointly in distribu- 
tion. 

(iii) For every (non-random) F GlA, the random variables t(F,G n ) con- 
verge in distribution. 

(iv) For every (non-random) F G IA, the expectations Wjt(F,G n ^) con- 
verge. 



If these properties hold, then the limits in (ii) (iii) and (iv) are T)) ._, , 



t(F,T) andMt(F,T), respectively. Furthermore, T S Uoo o,.s. 
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The same results hold if t is replaced by t- m j or t m d . 

Proof, (i) <^=^ (ii) Since U is a closed subset of [0, , convergence in dis- 
tribution in hi is equivalent to convergence of r + (G n ) = ((t(F,G n ))Feu, f(C n ) _1 ) 
in [0, , Since we assume v(G n )~' 1 — 0, this is equivalent to convergence 
of (t(F, G n )) FeU m [0) 1] W @> Theorem 4.4], which is equivalent to conver- 
gence in distribution of all finite families (t(Fi,G n )) 1 ^ =1 . 
Trivial. 

Immediate, since t is bounded (by 1). 



in 



iv. 



in 



iv 



> (ii) Let Fx, . . . , F m be fixed graphs and let ix, ■ ■ ■ 
integers. Let F be the disjoint union of li copies of Fi, i = '. 
for every G £hi, from the definition of t, 

m 

t(F,G) =Y[t(Fi,G) l \ 



t m be positive 
. . . , m. Then, 



i=X 



and hence 



EjJt(F i5 G)^ = Et(F, G). 



(3.1) 



Consequently, if (iv) holds, then every joint moment IEF]™ 1 t(Fi, G) * of 
t(Fx, G n ), . . . , t{F m , G n ) converges. Since t(Fi, G n ) are bounded (by 1), this 
implies joint convergence in distribution by the method of moments. 

The identification of the limits is immediate. Since v{G n ) — oo, (i) 
implies that v(T) = oo a.s., and thus T G U^. 

Finally, it follows from (|2.4p . (|2.5p and (|2.6p that we can replace t by 



or ti nc j in (ii) and (iv) , and the implications 



in 



and (iii) => (iv) 



are immediate for t- m ] and too 



□ 



Specializing to the case of a non-random limit G G Uqo, we obtain the 
corresponding result for convergence in probability. 

Corollary 3.2. Let G n , n > 1, be random unlabelled graphs such that 
v(G n ) — > oo, and let G G Uoo- The following are equivalent, as n — > oo. 

(i) G n G. 

(ii) t(F,G n ) — ^ t(F,G) for every (non-random) F G hi. 

(iii) Et(F, G n ) — ► t(-F, G) /or every (non-random) F £hi. 

The same result holds if t is replaced by t{ n $ or t- m ^. 

Note further that under the same assumptions, it follows directly from 
Theorem O that G n ^> G if and only if t(F,G n ) t(F, G) for every 

Few. 

We observe another corollary to Theorem 13.11 (and its proof). 
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Corollary 3.3. If T is a random element ofUoo = U\U = U* , then, for 
every sequence Fx, . . . , F m of graphs, possibly with repetitions, 

m 

Ej]i(F i ,r)=Ei(©^ 1 F i ,r), (3.2) 

i=i 

where ©^-^i denotes the disjoint union of F\, . . . ,F m . As a consequence, 
the distribution ofT is uniquely determined by the numbers Kt(F, T), F € U. 
Alternatively, the distribution of T is uniquely determined by the numbers 
Et- md (F,T), FeU. 

Proof. Since U is dense in U D W^, there exists random unlabelled graphs 

G n such that G n F. In particular, G n T and v(G n ) — oo (in fact, 
we may assume v(G n ) = n), so Theorem 13,11 and its proof apply, and (|3.2p 
follows from (13, ip applied to G n by letting n — > oo. 

For the final statement, note that (|3.2p shows that the expectations 
Et(F,T), F eW, determine all moments Ef]™! t(Fi,T), and thus the joint 
distribution of t(F, V), F € U, which is the same as the distribution of 
r(r) = (t(F, r)) FgW € [0, and we have defined Uoo such that we iden- 
tify r and r(r). Finally, the numbers Eti n d(i ? , T), F € Lt, determine all 
Ei(F,r) by (1231), recalling that t inj (F,T) = t(F, T) by (l2~lTl . □ 

Remark 3.1. The numbers Ei(F, T) for a random T £ Uoo thus play a 
role similar to the one played by moments for a random variable. (And 
the relation between Kt(F, T) and Et; n( j(i 71 , T) has some resemblance to the 
relation between moments and cumulants.) 

4. Convergence to infinite graphs 

We will in this section consider also labelled infinite graphs with the 
vertex set N = {1, 2, . . . }. Let denote the set of all such graphs. These 
graphs are determined by their edge sets, so Coo can be identified with the 
power set V(E(K QO )) of all subsets of the edge set E(Koo) of the complete 
infinite graph Koo, and thus with the infinite product set {0, We 
give this space, and thus Coo, the product topology. Hence, Coo is a compact 
metric space. 

It is sometimes convenient to regard C n for a finite subset of Coo'- 

we can identify graphs in C n and Coo with the same edge set. In other 
words, if G € C ri is a graph with vertex set [n], we add an infinite number 
of isolated vertices n + 1, n + 2, . . . to obtain a graph in Coo- 

Conversely, if H € Coo is an infinite graph, we let i2"|r n i € C n be the 
induced subgraph of H with vertex set [n] . 

If G is a (finite) graph, let G be the random labelled graph obtained by 
a random labelling of the vertices of G by the numbers 1, . . . ,v(G). (If G 
is labelled, we thus ignore the labels and randomly relabel.) Thus G is a 
random finite graph with the same number of vertices as G, but as just said, 
we can (and will) also regard G as a random graph in Coo- 
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We use the same notation G also for a random (finite) graph G given a 
random labelling. 

Theorem 4.1. Let (G n ) be a sequence of random graphs in U and assume 
that v(G n ) — — > oo. Then the following are equivalent. 

(i) G n — — > F inhl for some random F & U. 

(ii) G n — : > in /or some random H G C^. 

If these hold, then F(H\ [k] = F) = Ei ind (F,r) for every F G C k . Further- 
more, F G Uoo a.s. 

Proof. Let G be a labelled graph and consider the graph C?|ny, assuming /c < 
This random graph equals G[k}' = G(v[, . . . , v' k ), where v[, . . . ,v' k are 
k vertices sampled at random without replacement as in Section [2l Hence, 
by (USD, for every F G C k , 

¥(G\ [k] = F)=t ind (F,G), ifk< v(G). 

Applied to the random graph G n , this yields 

Et ind (F,G n ) <nGn\ [k ] =F) <Et ind (F,G n )+P{v(G n ) < k). (4.1) 

By assumption, P (v(G n ) < k) — > as n — > 00, and it follows from (|4.ip and 

Theorem 13.11 that G n r in U if and only if 

F(G n \ [k] =F)^Et ind (F,F) (4.2) 

for every > 1 and every F £ C k . 

Since is a finite set, (14. 2 p says that, for every k, G n \m — ► H k for 
some random graph H k G £^ with P(i7fe = F) = Ei ind (F, T) for F G 

Since has the product topology, this implies G n —> in for some 
random H G £tx> with = H k . 

Conversely, if G n H in C^, then Gr n |[fc] ^1[fc] so ^ ne argument 
above shows that 

Et ind (F,G n )=¥(G n \ [k] = F) + o(l)^F(H\ [k] = F) 

as n — ► co, for every F £ C k , and Theorem 13.11 yields the existence of some 

random r G Woo C U with G n F and Et ind (F, T) = F(H\ [k] = F). □ 

5. Exchangeable random graphs 

Definition. A random infinite graph H G Coo is exchangeable if its distribu- 
tion is invariant under every permutation of the vertices. (It is well-known 
that it is equivalent to consider only finite permutations, i.e., permutations 
a of N that satisfy o~(i) = i for all sufficiently large i, so a may be regarded 
as a permutation in <3 n for some n.) 

Equivalently, if X{j := l[ij G H] is the indicator of there being an edge 
ij in H, then the array {Xij}, 1 < i,j < 00, is (jointly) exchangeable as 
defined in Section [0 
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Lemma 5.1. Let H be a random infinite graph in Coo. Then the following 
are equivalent. 

(i) H is exchangeable. 

(ii) has a distribution invariant under all permutations of [k], for 
every k > 1. 

(iii) P(i?|rju = F) depends only on the isomorphism type of F, and can 
thus be seen as a function of F as an unlabelled graph in Uf., for 
every k > 1. 

Proof, (i) ==> (ii). Immediate. 

(ii) ==> (i). If a is a finite permutation of N, then a restricts to a per- 
mutation of [k] for every large k, and it follows that if H o a is H with the 

vertices permuted by a, then, for all large k H o = H\^] ° o~ = H\[kh 

which implies H o a = H. 

(ii) (iii). Trivial. □ 

Theorem 5.2. The limit H is Theorem \4-l\ is exchangeable. 

Proof. H satisfies Lemma l5.1( iii). □ 

Moreover, Theorem 14.11 implies the following connection with random 
elements of U r 



oo • 



Theorem 5.3. There is a one-to-one correspondence between distributions 
of random elements V G IAqq (orU*) and distributions of exchangeable ran- 
dom infinite graphs H G given by 

Et- md (F,T)=F(H\ [k] = F) (5.1) 
for every k > 1 and every F € Ck, or, equivalently, 

E t(F, T) = F(H D F) (5.2) 

for every F E C Furthermore, H\\ n \ -—* Y inlA as n — > oo. 

Proof. Note first that (15. ID and (15. 2|) are equivalent by (12. 5|) and (12. 6h . since 
i(F,r) = t inj (F,r) by (f2TT0]h and H D F if and only if F| [fe] D F when 

Suppose that T is a random element of Woo C U. Since W is dense in U, 
there exist ( as in the proof of Corollary 13. 3|) random unlabelled graphs Gn 

such that G n ^* T in U and thus v(G n ) ^> oo and G n —* V. Hence, 

Theorems 14.11 and 15.21 show that G n — > -fT for some random exchange- 



able infinite graph H satisfying (|5.ip . Furthermore, (|5.ip determines the 
distribution of H|rw for every fc, and thus the distribution of k. 

Conversely, if H is an exchangeable random infinite graph, let G n — 
-ff|r n i. By Lemma I5.1( ii). the distribution of each G n is invariant under 



permutations of the vertices, so if G n is G n with a random (re)labelling, 
we have G n = G n . Since G n H in (because has a product 
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topology), we thus have G n — > H in C^, so Theorem 14.11 applies and 

shows the existence of a random T G Z^oo such that G n — — > T and (|5.ip 
holds. Finally (15. ip determines the distribution of T by Corollary 13.31 □ 

Remark 5.1. Moreover, fl"|r n i converges a.s. to some random variable T G 
Uoo, because imd(F i?|[ n i), n > v(F), is a reverse martingale for every F G F. 
Alternatively, this follows by concentration estimates from the representa- 
tion in Section [U see Lovasz and Szegedy |1_9|, Theorem 2.5]. 

Corollary 5.4. There is a one-to-one correspondence between elements F 
of Uoo — and extreme points of the set of distributions of exchangeable 
random infinite graphs H G Coo- This correspondence is given by 

t(F, F) = P(H D F) (5.3) 

for every F G C. Furthermore, HU n -t •^-4 F inlA as n — > oo. 

Proof. The extreme points of the set of distributions on Uoo are the point 
masses, which are in one-to-one correspondence with the elements oiUoo- □ 

We can characterize these extreme point distributions of exchangeable 
random infinite graphs as follows. 

Theorem 5.5. Let H be an exchangeable random infinite graph. Then the 
following are equivalent. 

(i) The distribution of H is an extreme point in the set of exchangeable 
distributions in Coo. 

(ii) If Fx and F2 are two (finite) graphs with disjoint vertex sets V(F\), 
V(F 2 ) C N, then 

¥{H DFiU F a ) = ¥(H D Fi) F(H D F 2 ). 

(iii) The restrictions H\\k] and ff|[fc+i )0 o) are independent for every k. 

(iv) Let T n be the a-field generated by H\r niOC \. Then the tail a-field 
HnLi is trivial, i.e., contains only events with probability or 1. 

Proof, (i) ==^ (ii). By Corollary 15.41 H corresponds to some (non-random) 
r G Uoo such that 

P(H D F) = t(F, F) (5.4) 

for every F G C. We have defined C such that a graph F G C is labelled 
by 1, . . . ,v(F), but both sides of (|5.4p are invariant under relabelling of F 
by arbitrary positive integers; the left hand side because H is exchangeable 
and the right hand side because t(F, F) only depends on F as an unlabelled 
graph. Hence (|5.4p holds for every finite graph F with V(F) C N. 

Furthermore, since F is non-random, Corollary 13.31 yields t(F% UF2,T) = 
t(Fi,r)t(F 2 ,r). Hence, 

F(H d Fx U F 2 ) = t{F x U F 2 ,F) = i(F 1; F)t{F 2 ,F) = P(H D F 1 )F(H D F 2 ). 

(ii) (iii). By inclusion-exclusion, as for (|2.3p . (ii) implies that if 
1 < k < I < 00, then for any graphs F± and F 2 with V{F\) = {1, . . . , k} 
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and V(i*2) = {k + 1, . . . , k + /}, the events HUu = F\ and H \{k+i,...,i} = F% 
are independent. Hence and H\ik,...,l} are independent for every / > k, 
and the result follows. 

(iii) =^ (iv). Suppose A is an event in the tail cr-field P)^Li^n- Let T* 



be the cr-field generated by HU^. By (iii) , A is independent of T* for every 



re, and thus of the cr-field T generated by |J J-*, which equals the cr-field T\ 
generated by H. However, A E J-\, so A is independent of itself and thus 
F(A) = or 1. 

(iv) (i). Let F £ £k for some k and let F n be F with all vertices 
shifted by re. Consider the two indicators / = 1[H D F] and /„, = 1[H ~D F n ]. 
Since I n is ^-measurable, 

P(H d F U F n ) = E(// n ) = E(E(7 | F n )I n ). (5.5) 

Moreover, E(J | .T"^), n = 1, 2, . . . , is a reverse martingale, and thus a.s. 

00 

E(J|.F n )-»E(l| f|^ n ) = EJ, 

71=1 

using (iv). Hence, (E(J | .F n ) — El)l n — ► a.s., and by dominated conver- 
gence 

e((E(I |^„)-E/)/ n ) -.0. 

Consequently, (|5.5p yields 

P(# D F U F n ) = E IE I n + o(l) = F(H D F) F(H D F n ) + o(l). 

Moreover, since H is exchangeable, P(i7 D F U F n ) (for n > v(F)) and 
P(iT D F n ) do not depend on n, and we obtain as n — ► 00 

P(# D FUFfc) = P(# D F) 2 . (5.6) 

Let r be a random element of Uoq corresponding to H as in Theorem 15.31 
By (JO!) and (pT2]l . can be written 

Et(F,r) 2 = (Et(F,r)) 2 . 

Hence the random variable t(F, T) has variance so it is a.s. constant. Since 
this holds for every F 6 C, it follows that T is a.s. constant, i.e., we can take 



r non-random, and (i) follows by Corollary 15. 4[ □ 



6. Representations of graph limits and exchangeable graphs 

As said in the introduction, the exchan gea ble infinite random graphs were 
characterized by Aldous [1] and Hoover [141 ]. see also Kallenberg [151], and 
the graph limits in 11^ = U* were characterized in a very similar way by 
Lovasz and Szegedy [191 ]. We can now make the connection between these 
two characterizations explicit. 

Let W be the set of all measurable functions W : [0, l] 2 — » [0, 1] and let 
W s be the subset of symmetric functions. For every W € W s , we define 
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an infinite random graph G(oo,W) G Coo as follows: we first choose a se- 
quence Xi, X2, ■ ■ ■ of i.i.d. random variables uniformly distributed on [0, 1], 
and then, given this sequence, for each pair with i < j we draw an 
edge ij with probability W(X{, Xj), independently for all pairs with 
i < j (conditionally given {Aj}). Further, let G(n,W) be the restriction 
G(oo, W)|[ n ], which is obtained by the same construction with a finite se- 
quence X\ , . . . , X n . 

It is evident that G(oo, W) is an exchangeable infinite random graph, and 
the result by Aldous and Hoover is that every exchangeable infinite random 
graph is obtained as a mixture of such G(oo, W); in other words as G(oo, W) 
with a random W. 

Considering again a deterministic W G W s , it is evident that Theo- 
rem l5.5f ii) holds, and thus Theorem l5.5l and Corollarv l5.4l show that G(oo, W) 
corresponds to an element G Uoo- Moreover, by Theorem 15.31 and Re- 
mark [5J3 G(n,W) — > T\y a.s. as n — > 00, and (|5.3p shows that if F € 
then 

t(F,T w )=F(FCG(k,W))= [ TT W{x l ,x 3 )dx l ...dx k . (6.1) 

The main result of Lovasz and Szegedy [19( is that every element of Uoo = U* 
can be obtained as Tw satisfying ()6.ip for some W G W s . 

It is now clear that the representation theorems of Aldous-Hoover [l|, [13] 
and Lovasz and Szegedy [3] are connected by Theorem l5.3l and Corollary [53] 
above, and that one characterization easily follows from the other. 

Remark 6.1. The representations by W are far from unique, see Section [71 
Borgs, Chayes, Lovasz, Sos and Vesztergombi call an element W G W s a 
graphon. They further define a pseudometric (called the cut- distance) on W s 
and show that if we consider the quotient space W s obtained by identifying 
elements with cut-distance 0, we obtain a compact metric space, and the 
mapping W 1— > T\y yields a bijection W s — > U* = Uoo, which furthermore is 
a homeomorphism. 



Remark 6.2. As remarked in Lovasz and Szegedy 19|], we can more gen- 
erally consider a symmetric measurable function W : S 2 —> [0, 1] for any 
probability space (5,/x), and define G(oo,W) as above with Xi i.i.d. ran- 
dom variables in S with distribution [i. This does not give any new limit 
objects G(oo, W) or Tw, since we just said that every limit object is obtained 
from some W G W s , but they can sometimes give useful representations. 

An interesting case is when W is the adjacency matrix of a (finite) graph 
G, with S = V(G) and fi the uniform measure on S; we thus let Xi be i.i.d. 
random vertices of G and G(n, W) equals the random graph G[n] defined in 
Section El It follows from (EH]) and ([2Tj that t(F,F w ) = t(F, G) for every 
F G U, and thus T\y = G as elements of U* . In other words, Tw G Uoo = 
tt(G), the "ghost" of G in Uoo = U* . 
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Remark 6.3. For the asymptotic behavior of G(n, W) in another, sparse, 
case, with W depending on n, see 

7. NON-UNIQUENESS 

The functions W on [0, l] 2 used to represent graph limits or exchangeable 
arrays are far from unique. (For a special case when there is a natural 
canonical choice, which much simplifies and helps applications, see [l~2j].) 
For example, it is obvious that if <p : [0, 1] — > [0, 1] is any measure preserving 
map, then W and W o tp, defined by W o tp(x,y) := W\tp{x), cp{y)) , define 
the same graph limit and the same (in distribution) exchangeable array. 

Although in principle, this is the only source on non-uniqueness, the de- 
tails are more complicated, mainly because the measure preserving map 
tp does not have to be a bijection, and thus the relation W' = W o ip 
is not symmetric: it can hold without there being a measure preserving 
map tp' such that W = W o tp'. (For a 1-dimensional example, consider 
f(x) = x and f'(x) = tp{x) = 2x mod 1; for a 2-dimensional example, let 
W(x, y) = f(x)f(y) and W'(x, y) = f'(x)f'(y).) 

For exchangeable arrays, the equivalence problem was solved by Hoover 



[14(, who gave a criterion which in our case reduces to (vi) below; this 
criterion involves an auxiliary variable, and can be interpreted as saying 
W = W' o cp' for a random tp'. This work was continued by Kallenberg, see 



151 ] , who gave a probabilistic proof and added criterion (v) For graph limits, 

in 



Borgs, Chayes, Lovasz, Sos and Vesztergombi [0] gave the criterion (vii / 1 
terms of the cut-distance, and Bollobas and Riordan [1] found the criterion 
(v) in this context. Further, Borgs, Chayes, Lovasz, Sos and Vesztergombi 
[8[ announced the related criterion that there exists a measurable function 
U : [0,1] 2 — ► [0,1] and two measure preserving maps (p,(p' : [0,1] — > [0,1] 
such that W = U o (p and W = U o (p' a.e.; the proof of this will appear in 
@. 

As in Section [61 these two lines of work are connected by the results in 
Section [5l and we can combine the previous results as follows. 

Theorem 7.1. Let W,W € W s . Then the following are equivalent. 

(i) T\y = T\y> for the graph limits Ty/i^w G ^oo- 

(ii) t(F,Tw) = t(F,Tw') for every graph F. 

(iii) The exchangeable random infinite graphs G(oo, W) and G(oo,W) 
have the same distribution. 

(iv) The random graphs G(n, W) and G(n, W) have the same distribu- 
tion for every finite n. 

(v) There exist measure preserving maps <p, <p' : [0, 1] — > [0, 1] such that 
Woip = Wo(p> a.e. on [0, l] 2 , i.e., W(<p(x),(p(y)) = W (ip' \x) , (p' (y)) 
a.e. 

(vi) There exists a measure preserving map ip : [0, l] 2 — > [0, 1] such that 
W(x 1 ,x 2 ) = W'(ifj(x 1 ,yi),tp(x2,y2)) a.e. on [0, l] 4 . 

(vii) 5\j(W,W) = 0, where 5\j is the cut-distance defined in f^j. 



18 



PERSI DIACONIS AND SVANTE JANSON 



Proof, (i) 



in 



(ii) By our definition of C U. 



(iii)| By Corollary EH 
Obvious. 

If Xi, X2, ■ ■ ■ are i.i.d. random variables uniformly dis- 



iv 
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tributed on [0,1], then so are cp(Xx), (piX^) 
G(po, Wo<p)= G(oo, W o if/) = G(oo, W). 



and thus G(oo,W) 



_ fiii) ==>■ (v) The general form of the representation theorem as stated in 
|15l. Theorem 7.15, see also p. 304] is (in our two-dimensional case) Xy = 
/(6, 6, 6> 6.7) f° r a function / : [0, l] 4 — > [0, 1], symmetric in the two middle 
variables, and independent random variables 6, 6 (1 < i) and 6j (1 < i < 
j), all uniformly distributed on [0,1], and where we further let 6'j = 6j for 
jf > i. We can write the construction of G(oo, W) in this form with 

/(^Cj^i) = l&j < ^(6, 0)]- (7.1) 

Note that this / does not depend on £0. (In general, £0 is needed for the 
case of a random W, which can be written as a deterministic function of £0, 
but this is not needed in the present theorem.) 

Suppose that G(oo, W) = G(oo, W), let / be given by W by ([EI]), and let 
similarly /' be given by W'; for notational convenience we write W\ := W, 
W2 '■= W , fx ■= f and /2 := /'. The equivalence theorem flil . Theorem 
7.28] takes the form, using (|7.1j) . that there exist measurable functions gk,o '■ 
[0,1] -> [0,1], 5M : [0,1] 2 [0,1] and g K2 : [0, l] 4 [0,1], for k = 1,2, 
that are measure preserving in the last coordinate for any fixed values of the 
other coordinates, and such that the two functions (for k = 1 and k = 2) 

/fc(#fc,o(£0)'5fc,i(£0!£i)>Sfc,i(£0> £2), 5fc,2(6> 6>6>62)) 

= 1 [Wk{9k,i(Z$,€i),9k,i(€i>i&)) - 5/0,2(6,6,6,62)] 

are a.s. equal. Conditioned on £0,6 and 6, the random variable 5^,2(6, 6, £2, 62) 
is uniformly distributed on [0,1], and it follows (e.g., by taking the condi- 
tional expectation) that a.s. 

Wl (51,1(6, 6), (£0, 6)) = ^2(52,1(6, 6), 52,i(6, &))■ 

For a.e. value xo of 6, this thus holds for a.e. values of 6 an d 6, an d we 
may choose ip(x) = 51,1(2:0, ^) an d ^p'(x) := g2,i(xo,x) for some such xq. 
Similar, using (lil . Theorem 7.28(iii)]. 
See 1. □ 



VI 



(vii) 



8. Bipartite graphs 

The definitions and results above have analogues for bipartite graphs, 
which we give in this section, leaving some details to the reader. The proofs 
are straightforward analogues of the ones given above and are omitted. Ap- 
plications of the results of this section to random difference graphs are in 
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A bipartite graph will be a graph with an explicit bipartition; in other 
words, a bipartite graph G consists of two vertex sets V\(G) and V 2 (G) and 
an edge set E(G) C V\(G) x V 2 (G); we let vi(G) := \Vx(G)\ and v 2 (G) := 
\V2(G)\ be the numbers of vertices in the two sets. Again we consider both 
the labelled and unlabelled cases; in the labelled case we assume the labels 
of the vertices in Vj(G) are 1, . . . , Vj(G) for j = 1, 2. Let B^ in be the set 
of the 2" -1 ™ 2 labelled bipartite graphs with vertex sets [i%x] and [n 2 ], and 
let B riiri2 be the quotient set B^ lU2 / = of unlabelled bipartite graphs with 
n\ and n 2 vertices in the two parts; further, let B L := (J >x B% in2 and 

& '■= Uni,n 2 >l ^™i™2- 

We let G[/ci,/c2] be the random graph in B^ ik2 obtained by sampling 
kj vertices from Vj(G) (j = 1,2), uniformly with replacement, and let, 
provided kj < Vj(G), G[k±, k 2 ]' be the corresponding random graph obtained 
by sampling without replacement. We then define t(F,G), t; n j(F, G) and 
tmd(F,G) for (unlabelled) bipartite graphs F and G in analogy with (|2.1j) 
(|2.3j) . Then (|2.4j) - (12.6p still hold, mutatis mutandis; for example, 

In analogy with (|2.7p . we now define r : B -> [0, if by 

r(G) :=(t(F,G)) FeB G [0, (8.2) 

We define B* := r(B) C [0, \] B to be the image of B under this mapping 
r, and let B* be the closure of B* in [0, 1] B ; this is a compact metric space. 

Again, r is not injective; we may consider a graph G as an element of B* 
by identifying G and t(G), but this implies identification of some graphs of 
different orders and we prefer to avoid it. We let B + be the union of B and 
some two-point set {*i, *2} and consider the mapping r + : B — » [0, = 
[0, 1] B x [0, 1] x [0, 1] defined by 

t + (G) = {t(G),v 1 (G)- 1 ,v 2 (G)- 1 ). (8.3) 

Then r + is injective and we can identify B with its image t + {B) C [0, 
and define B C [0, as its closure; this is a compact metric space. 

The functions t(F, •), ti n j(F, •), ti n d(i ? , ■) and Vj(-) , for F E S and j = 
1,2, have unique continuous extensions to B. 

We let #0000 := {G G B : v\{G) = v 2 (G) = 00}; this is the set of all 
limit objects of sequences (G n ) in B with vi(G n ),v 2 (G n ) — > cxo. By (|8.1h . 
ii n j(F, G) = t(F,G) for every G € £>oooo and every F £ B. The projection 
7r : £> — > restricts to a homeomorphism ^cooo — B* . 

Remark 8.1. Note that in the bipartite case there are other limit objects 
too in B; in fact, B can be partitioned into B, Boooo, and the sets B noo , £>oon, 
for n = 1, 2, . . . , where, for example, B ni00 is the set of limits of sequences 
(G n ) of bipartite graphs such that v 2 (G n ) — > 00 but t>i(G n ) = ni is constant. 
We will not consider such degenerate limits further here, but we remark that 
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in the simplest case n\ = 1, a bipartite graph in £>f n2 can be identified with 
a subset of [712], and an unlabelled graph in B\ n2 thus with a number in 
m G {0, . . . , n.2}, the number of edges in the graph, and it is easily seen that 
a sequence of such unlabelled graphs with 77,2 — > 00 converges in £> if and 
only if the proportion rajni converges; hence we can identify Bx^ with the 
interval [0,1]. 

We have the following basic result, cf. Theorem 12.11 

Theorem 8.1. Let (G n ) be a sequence of bipartite graphs with vi(G n ), 
V2{G n ) —* °o. Then the following are equivalent. 

(i) t(F, G n ) converges for every F G B. 

(ii) ti n j(F, G n ) converges for every F G B. 

(iii) ti n i(F, G n ) converges for every F G B. 

(iv) G n converges in B. 



In this case, the limit G of G n belongs to *Boooo 

and the limits in mTu |(iii) 

and (iii) are t(F, G), U n j(F, G) and ti n d(F, G). 



For convergence of random unlabelled bipartite graphs, the results in 
Section [3] hold with trivial changes. 

Theorem 8.2. Let G n , n > 1, be random unlabelled bipartite graphs and 
assume that vi(G n ), V2(G n ) — > 00. The following are equivalent, as n — > 00. 

(i) G n — — ► r for some random V G B. 

(ii) For every finite family F±, . . . , F m of (non-random) bipartite graphs, 
the random variables t(Fi, G n ), ■ ■ ■ , t(F m , G n ) converge jointly in dis- 
tribution. 

(iii) For every (non-random) F G B, the random variables ti^F, G n ) con- 
verge in distribution. 

(iv) For every (non-random) F G B, the expectations Mt(F,G n ) con- 
verge. 

1 m 



If these properties hold, then the limits in (ii) (iii) and (iv) are T)) 
t(F,T) andEt(F,T), respectively. Furthermore, T G £>oooo a.s. 
The same results hold if t is replaced by t in j or i inc j . 

Corollary 8.3. Let G n , n > 1, be random unlabelled bipartite graphs such 
that v%(G n ), V2(G n ) — oo, and let G G £?oooo- The following are equivalent, 
as n — > 00. 

(i) G n G. 

(ii) t(F,G n ) — — > t(F,G) for every (non-random) F G £>. 

(iii) Et(F, G n ) — ► t(F,G) for every (non-random) F G £>. 

T7ie same result holds if t is replaced by t{ n $ or t- m ^. 

As above, the distribution of T is uniquely determined by the numbers 

Ei(F,r), f e B. 
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Let iJ^oo denote the set of all labelled infinite bipartite graphs with the 
vertex sets V\(G) = V^G) = N. B^^ is a compact metric space with the 
natural product topology. 

If G is a bipartite graph, let G be the random labelled bipartite graph 
obtained by random labellings of the vertices in Vj(G) by the numbers 
1, . . . ,Vj(G), for j = 1,2. This is a random finite bipartite graph, but we 
can also regard it as a random element of B^^ by adding isolated vertices. 

Definition. A random infinite bipartite graph H G B^^ is exchangeable if 
its distribution is invariant under every pair of finite permutations of V\ (H) 
and V 2 (H). 

Theorem 8.4. Let (G n ) be a sequence of random graphs in B and assume 
that vi(G n ),V2(G n ) — !— ► oo. Then the following are equivalent. 

(i) G n —> F in B for some random F G B. 

(ii) G n H in B^^ for some random H G S^oo- 

// these hold, then P(fT| [fcl]x[fca] = F) = Et ind (F,r) /or ewery F G B^. 
Furthermore, F G i?oooo g-s v and if is exchangeable. 

Theorem 8.5. There is a one-to-one correspondence between distributions 
of random elements F G £>oooo (or B*) and distributions of exchangeable 
random infinite graphs H G B^^ given by 

Et ind (F,F)=¥(H\ [kl]x[k2] =F) (8.4) 
for every fci, &2 > 1 and every F G B kik2 , or, equivalently, 

Et(F, T) = P(H D F) (8.5) 

for every F G B L . Furthermore, iT|r ni i x r n2 i F in B as n±,ri2 — > oo. 

Corollary 8.6. There is a one-to-one correspondence between elements F 
of Boooo = B* and extreme points of the set of distributions of exchangeable 
random infinite graphs H G B^^. This correspondence is given by 

t(F, F) = F(H D F) (8.6) 
for every F G B L . Furthermore, i7|r ni ] x r n2 ] — !— > F in B as n±,n2 — ► oo. 

Remark 8.2. We have not checked whether iT|[ ni ] x [ n2 ] r in 6 as 

n\,n 2 — ► oo. This holds at least for a subsequence (ni(m), n2(m)) with 
both m(m) and n2(m) non-decreasing because then ti n j(F, #1 [m] x [n 2 ] ) i s a 
reverse martingale. 

Theorem 8.7. Lei ii 6e an exchangeable random infinite bipartite graph. 
Then the following are equivalent. 

(i) The distribution of H is an extreme point in the set of exchangeable 
distributions in Bjt <nn . 
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(ii) If F\ and F 2 are two (finite) bipartite graphs with the vertex sets 
Vj{F\) and Vj{]?2) disjoint subsets o/N for j = 1,2, then 



The construction in Section [6] takes the following form; note that there 
is no need to assume symmetry of W. For every W G W, we define an 
infinite random bipartite graph G(oo,oo,W) € fi^oo as follows: we first 
choose two sequence X\,X%,... and Ij,!^,... of i.i.d. random variables 
uniformly distributed on [0,1], and then, given these sequences, for each 
pair € N x N we draw an edge ij with probability W(Xi,Yj), in- 

dependently for all pairs (i,j). Further, let G(ni,ri2,W) be the restriction 
G(oo, 00, W / )|[ni]x[n 2 ]i which is obtained by the same construction with finite 
sequences X\ , . . . , X ni and Y\ , . . . , Y n2 . 

It is evident that G(oo,oo,W) is an exchangeable infinite random bi- 
partite graph. Furthermore, it satisfies Theorem I8.7|(ii)| Theorem 18.51 and 
Corollary 18.61 yield a corresponding element T'^y € Sqoqo = B* such that 
G(ni,n 2 , W) T'Jy as ni,n 2 — > 00 and, for every F G Bj: ' k , 



The result by Aldous [l[ in the non-symmetric case is that every ex- 
changeable infinite random bipartite graph is obtained as a mixture of such 
G(oo, 00, W); in other words as G(oo, 00, W) with a random W . 

By Theorem 18.51 and Corollary 18.61 above, this implies (and is implied by) 
the fact that every element of B equals T'^ for some (non-unique) W £ W; 
the bipartite version of the characterization by Lovasz and Szegedy . 



A directed graph G consists of a vertex set V(G) and an edge set E{G) C 
V(G) x V(G); the edge indicators thus form an arbitrary zero-one matrix 
{Xij}, i,j € V{G). Note that we allow loops, corresponding to the diagonal 
indicators Xa. The definitions and results above have analogues for directed 
graphs too, with mainly notational differences. We sketch these in this 
section, leaving the details to the reader. 

Let be the set of the 2 n labelled directed graphs with vertex set [n] 
and let V n be the quotient set T>^ / = of unlabelled directed graphs with n 
vertices; further, let V L := \J n>1 and V := \J n>1 V n . 

The definitions in Section ETapply to directed graphs too, with at most 
notational differences. G[k] and G[k]' now are random directed graphs and 
t(F, G), ti n j(F, G) and t- m( ±(F, G) are defined for (unlabelled) directed graphs 
F and G by l^-l^. We now define r : V -> [0, l] v by, cf. flZZD, 



F{H D Ft U F 2 ) = F{H D F{)¥(H D F 2 ). 




(8.7) 



9. Directed graphs 



t(G) := (t(F,G)) FeV €[0,1] 



(9.1) 
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We define V* := t(V) C [0, l] v to be the image of T> under this mapping r, 
and let T>* be the closure of V* in [0, l] v ; this is a compact metric space. 

Again, r is not injective. We let T> + be the union of V and some one- 
point set {*} and consider the mapping r + : T> — > [0, l] v+ = [0, l] 25 x [0, 1] 
defined by (|2.9p as before. Then r + is injective and we can identify T> with 
its image t + (T>) C [0, and define T> C [0, 1] I,+ as its closure; this is a 
compact metric space. The functions t(F,-), t; n j(F, •), t in( ^(F, •) and u(-) , 
for F (iV, have unique continuous extensions to D. 

We let T>oo := {G G D : f (G) = oo}; this is the set of all limit objects 
of sequences (G n ) in V with v{G n ) -> oo. By (JH]), t in j(F,G) = t(F, G) for 
every G G 2?oo and every F £ T>. The projection 7r : T> — > V* restricts to a 
homeomorphism = T>*. 

All results in Sections [2H5] are valid for directed graphs too, with at most 
notational differences. 

The main difference for the directed case concerns the representations 
discussed in Section [6j Since two vertices may be connected by up to two 
directed edges (in opposite directions), and the events that the two possible 
edges occur typically are dependent, a single function W is no longer enough. 
Instead, we have a representation using several functions as follows. 

Let W5 be the set of quintuples W = (Wqo, W01, Wio, Wu, w) where 
W a p : [0, l] 2 — > [0, 1] and w : [0, 1] — > {0, 1} are measurable functions such 
that T,a,/3=o W al3 = 1 and W a/3 (x,y) = Wp a (y,x) for a,/3 G {0,1} and 
x,y € [0,1]. For W E W5, we define a random infinite directed graph 
G(oo, W) by specifying its edge indicators Xij as follows: we first choose a 
sequence Yy, Y2, . . . of i.i.d. random variables uniformly distributed on [0, 1], 
and then, given this sequence, let Xa = wiYi) and for each pair with 
i < j choose Xij and Xji at random such that 

F(Xij = a and X j{ = 0) = Wa^Yj), a, fie {0, 1}; (9.2) 

this is done independently for all pairs with i < j (conditionally given 
{li}). In other words, for every i we draw a loop at i if w(Yi) = 1 and for 
each pair with i < j we draw edges ij and ji at random such that 

(19. 2p holds. Further, let G(n, W) be the restriction G(oo, W)|r n i, which is 
obtained by the same construction with a finite sequence Y±, . . . ,Y n . 

In particular, note that the loops appear independently, each with prob- 
ability p = F(w(Yi) = 1). We may specify the loops more clearly by the 
following alternative version of the construction. Let S := [0, 1] x {0, 1} 
and let W4 be the set of quadruples W = (Woo, W01, Wio, Wu) where 
W a p : S 2 — > [0, 1] are measurable functions such that Yla p=o W a /3 = 
1 and W a p(x,y) = Wp a {y,x) for a, (3 € {0,1} and x,y € S. For ev- 
ery W G W4 and p G [0,1], we define a random infinite directed graph 
G(oo,W,p) by specifying its edge indicators X^ as follows: We first choose 
sequences £1, £2, • • • and Cl> C2> • • ■ of random variables, all independent, with 
& ~ U{0, 1) and Q ~ Be(p), i.e., & G {0, 1} with P(& = 1) = p; we let 
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Yi := (^i)Ci) £ <5- Then, given these sequences, let JQj = £j and for each 
pair with i < j choose X{j and Xji at random according to (|9.2f) . 

independently for all pairs with i < j (conditionally given {1^}). In 
other words, Q is the indicator of a loop at i. Further, let G(n, W,p) be the 
restriction G f (oo,W,j>)[r fl i, which is obtained by the same construction with 
a finite sequence Yi , . . . , Y n . 

It is obvious from the symmetry of the construction that the random 
infinite directed graphs G(oo, W) and G(oo,W,p) are exchangeable. Fur- 
ther, using Theorem 15 .5^ their distributions are extreme points, so by Corol- 
lary |5.4l thev correspond to directed graph limits, i.e., elements of Poo, which 
we denote by Tw and T~w tP , respectively; ()5.3[) shows that if F G then 

t(F, r w ) = C W)) , t(F, r w , P ) = P(F C G(k, W,p)) . 

By Theorem 15.31 and Remark 15. 1[ G(n, W) — > Tw and G(n,W,p) — > r-w,p 
a.s. as n — > oo. 

We can show a version of the representation results in Section [6] for di- 
rected graphs. 

Theorem 9.1. An exchangeable random infinite directed graph is obtained 
as a mixture o/G(oo,W); in other words, as G(oo, W) with a random W. 
Alternatively, it is obtained as a mixture of G(oo, W,p); in other words, as 
G(oo, W,p) with a random (W,p). 

Every directed graph limit, i.e., every element of T> oa , is Tw for some 
W G Ws, or equivalently F\y,p /or some W G W4 and p G [0, 1]. 

Proof. For jointly exchangeable random arrays {-Xy} of zero-one variables, 
the Aldous-Hoover representation theorem takes the form [l5|, Theorem 
7.22] 

Xu = /i(£0,&), 

where f\ : [0, l] 2 — > {0, 1} and /2 : [0, l] 4 — > {0, 1} are two measurable func- 
tions, £ji = and ^0, & (1 < i) and £jj (1 < i < j) are independent random 
variables uniformly distributed on [0,1] (as in the proof of Theorem 17. ip . 
If further the distribution of the array {X^} is an extreme point in the set 
of exchangeable distributions, then by Theorem 15.51 and [la . Lemma 7.35], 
there exists such a representation where f\ and fi do not depend on £0, so 
Xu = /i(&) and = h{ii,Z,j,£ij), i ^ j- In this case, define w = ft and 

W aP (x,y) :=P(/ 2 (x,y,0 =a and/ 2 (y,z,£) = 0), a,/?G {0,1}, 

where £ ~ J7(0, 1). This defines a quintuple W G W5, such that the edge 
indicators X^j of G(oo, W) have the desired distribution. 

In general, the variable £g can be interpreted as making W random. 

To obtain the alternative representation, let Q := w(£i) = Xu and p := 
P(Ci = !)• There exists a measure preserving map <f> : (S,fi p ) — > [0,1] such 
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that [0,1] x {j} is mapped onto {x G [0,1] : w(x) = j} for j = 0,1 (i.e., 
we can use the quadruple {W a p o (f>) a ^. 
The representations for graph limits follow by Corollary 15.41 as discussed 
above. □ 

Example 9.2. A random tournament T n is a random directed graph on n 
vertices without loops where each pair of vertices is connected by exctly one 
edge, with random direction (with equal probabilities for the two directions, 
and independent of all other edges). This equals G(n,W) or G(n,W,p) 
with Woo = Wi\ = 0, Wqi = Wiq = 1/2, and w = or p = 0, and converges 
thus a.s. to the limit Tw.o for W = {W a p) a ^. 

Note that if {Xij} are the edge indicators of an exchangeable random 
infinite directed graph, then the loop indicators {Xjj} form a binary ex- 
changeable sequence, and the representation as G(oo, W,p) in Theorem 19. II 
exhibits them as a mixture of i.i.d. Be(p) variable, which has brought us 
back to deFinetti's theorem ll.il 
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